On Equivalence of Martingale Tail Bounds and Deterministic Regret Inequalities

نویسندگان

Alexander Rakhlin

Karthik Sridharan

چکیده

We study an equivalence of (i) deterministic pathwise statements appearing in the online learning literature (termed regret bounds), (ii) high-probability tail bounds for the supremum of a collection of martingales (of a specific form arising from uniform laws of large numbers for martingales), and (iii) in-expectation bounds for the supremum. By virtue of the equivalence, we prove exponential tail bounds for norms of Banach space valued martingales via deterministic regret bounds for the online mirror descent algorithm with an adaptive step size. We extend these results beyond the linear structure of the Banach space: we define a notion of martingale type for general classes of real-valued functions and show its equivalence (up to a logarithmic factor) to various sequential complexities of the class (in particular, the sequential Rademacher complexity and its offset version). For classes with the general martingale type 2, we exhibit a finer notion of variation that allows partial adaptation to the function indexing the martingale. Our proof technique rests on sequential symmetrization and on certifying the existence of regret minimization strategies for certain online prediction problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Concentration Inequalities for Dependent Random Variables via the Martingale Method

Abstract: We use the martingale method to establish concentration inequalities for a class of dependent random sequences on a countable state space, with the constants in the inequalities expressed in terms of certain mixing coefficients. Along the way, we obtain bounds on certain martingale differences associated with the random sequences, which may be of independent interest. As an applicatio...

متن کامل

A Trajectorial Interpretation of Doob’s Martingale Inequalities

We present a unified approach to Doob’s Lp maximal inequalities for 1 ≤ p < ∞. The novelty of our method is that these martingale inequalities are obtained as consequences of elementary deterministic counterparts. The latter have a natural interpretation in terms of robust hedging. Moreover our deterministic inequalities lead to new versions of Doob’s maximal inequalities. These are best possib...

متن کامل

Online Regret Bounds for Markov Decision Processes with Deterministic Transitions

We consider an upper confidence bound algorithm for Markov decision processes (MDPs) with deterministic transitions. For this algorithm we derive upper bounds on the online regret (with respect to an (ε-)optimal policy) that are logarithmic in the number of steps taken. These bounds also match known asymptotic bounds for the general MDP setting. We also present corresponding lower bounds. As an...

متن کامل

On joint distributions of the maximum, minimum and terminal value of a continuous uniformly integrable martingale

We study the joint laws of a continuous, uniformly integrable martingale, its maximum, and its minimum. In particular, we give explicit martingale inequalities which provide upper and lower bounds on the joint exit probabilities of a martingale, given its terminal law. Moreover, by constructing explicit and novel solutions to the Skorokhod embedding problem, we show that these bounds are tight....

متن کامل

Some Probability Inequalities for Quadratic Forms of Negatively Dependent Subgaussian Random Variables

In this paper, we obtain the upper exponential bounds for the tail probabilities of the quadratic forms for negatively dependent subgaussian random variables. In particular the law of iterated logarithm for quadratic forms of independent subgaussian random variables is generalized to the case of negatively dependent subgaussian random variables.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

On Equivalence of Martingale Tail Bounds and Deterministic Regret Inequalities

نویسندگان

چکیده

منابع مشابه

Concentration Inequalities for Dependent Random Variables via the Martingale Method

A Trajectorial Interpretation of Doob’s Martingale Inequalities

Online Regret Bounds for Markov Decision Processes with Deterministic Transitions

On joint distributions of the maximum, minimum and terminal value of a continuous uniformly integrable martingale

Some Probability Inequalities for Quadratic Forms of Negatively Dependent Subgaussian Random Variables

عنوان ژورنال:

اشتراک گذاری